Off-Policy Knowledge Maintenance for Robots

نویسندگان

  • Joseph Modayil
  • Patrick M. Pilarski
  • Adam White
  • Thomas Degris
  • Richard S. Sutton
چکیده

A fundamental difficulty in robotics arises from changes in the experienced environment—periods when the robot’s current situation differs from past experience. We present an architecture whereby many independent reinforcement learning agents (or demons) observe the behaviour of a single robot. Each demon learns one piece of world knowledge represented with a generalized value function. This architecture allows the demons to update their knowledge online and off-policy from the robot’s behaviour. We present one approach to active exploration using curiosity—an internal measure of learning progress—and conclude with a preliminary result showing how a robot can adapt its prediction of the time needed to come to a full stop.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive policy of buffer allocation and preventive maintenance actions in unreliable production lines

The buffer allocation problem is an NP-hard combinatorial optimization problem, and it is an important design problem in manufacturing systems. The research proposed in this paper concerns a product line consisting of n unreliable machines with n − 1 buffers and a preventive maintenance policy. The focus of the research is to obtain a better trade-off between the buffer level ...

متن کامل

A reliability-based maintenance technicians’ workloads optimisation model with stochastic consideration

The growing interest in technicians’ workloads research is probably associated with the recent surge in competition. This was prompted by unprecedented technological development that triggers changes in customer tastes and preferences for industrial goods. In a quest for business improvement, this worldwide intense competition in industries has stimulated theories and practical frameworks that ...

متن کامل

Reliability Based Optimal Preventive Maintenance Policy for High Voltage Circuit Breakers in Power Plants

  Electric power industry have always try to provide reliable electricity to customers and at the same time decrease system costs. High Voltage circuit-breakers are an essential part of the power network. This study has developed a maintenance and replacement scheduling model for high voltage circuit- breakers that minimize maintenance costs while maintaining the acceptable reliability. This mo...

متن کامل

Methods to choose the best Hidden Markov Model topology for improving maintenance policy

Prediction of physical particular phenomenon is based on partial knowledge of this phenomenon. Theses knowledges help us to conceptualize this phenomenon according to different models. Hidden Markov Models (HMM) can be used for modeling complex processes. We use this kind of models as tool for fault diagnosis systems. Nowadays, industrial robots living in stochastic environment need faults dete...

متن کامل

Building a maintenance policy through a multi-criterion decision-making model

A major competitive advantage of production and service systems is establishing a proper maintenance policy. Therefore, maintenance managers should make maintenance decisions that best fit their systems. Multi-criterion decision-making methods can take into account a number of aspects associated with the competitiveness factors of a system. This paper presents a multi-criterio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010